Journal of Beijing University of Posts and Telecommunications

  • EI核心期刊

JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOM ›› 2010, Vol. 33 ›› Issue (4): 44-48.doi: 10.13190/jbupt.201004.44.zhangjp

• Papers • Previous Articles     Next Articles

Ensemble Classifiers Research for Classify High Speed Data Stream Based on Biased Sample

  

  • Received:2009-10-10 Revised:2010-01-14 Online:2010-08-28 Published:2010-05-21

Abstract:

High speed data stream brings the phenomenon that the data rate is higher relative to the ensemble classifiers computational power, so the ensemble classifiers cant train all data which reached recently to update themselves. An ensemble classifiers is proposed based on biased sample. Expectation error is analyzed through biased variance decomposition method, and the data is also biased sampled by computing all datas expectation error contribution degree which is waited for being sampled. This method can reduce time to train and update ensemble classifiers and will be contrasted with random sample ensemble classifiers. It indicates that this method has more prediction accuracy on condition the same proportion of sample.

Key words: data stream, ensemble classifiers, biased sample, bias variance decomposition